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Abstract 

A spanner graph on a set of points in M. d contains a shortest path between any pair of points with length 
at most a constant factor of their Euclidean distance. A spanner with a sparse set of edges is thus a good 
candidate for network backbones, as desired in many practical scenarios such as the transportation network 
and peer-to-peer network overlays. In this paper we investigate new models and aim to interpret why good 
spanners 'emerge' in reality, when they are clearly built in pieces by agents with their own interests and 
the construction is not coordinated. Our main result is to show that the following algorithm generates a 
(1 + e)-spanner with a linear number of edges, constant average degree, and the total edge length as a 
small logarithmic factor of the cost of the minimum spanning tree. In our algorithm, the points build edges 
at an arbitrary order. When a point p checks on whether the edge to a point q should be built, it will build 
this edge only if there is no existing edge p'q' with p' and q' at distances no more than w^ptm • \p'q'\ 
from p, q respectively. Eventually when all points have finished checking edges to all other points, the 
resulted collection of edges forms a sparse spanner as desired. This new spanner construction algorithm 
can be extended to a metric space with constant doubling dimension and admits a local routing scheme to 
find the short paths. 

As a side product, we show a greedy algorithm for constructing linear-size well-separated pair decom- 
positions that may be of interest on its own. A well-separated pair decomposition is a collection of subset 
pairs such that each pair of point sets is fairly far away from each other compared with their diameters 
and that every pair of points is 'covered' by at least one well-separated pair. Our greedy algorithm selects 
an arbitrary pair of points that have not yet been covered and puts a 'dumb-bell' around the pair as the 
well-separated pair, repeats this until all pairs of points are covered. When the algorithm finishes, we show 
only a linear number of pairs is generated, which is asymptotically optimal. 

1 Introduction 



A geometric graph G defined on a set of points PCI" with all edges as straight line segments of weight 
equal to the length is called a Euclidean spanner, if for any two points p,q E P the shortest path between them 
in G has length at most A ■ \pq\ where \pq\ is the Euclidean distance. The factor A is called the stretch factor of 
G and the graph G is called a A-spanner. Spanners with a sparse set of edges provide good approximations for 
the pairwise Euclidean distances and are good candidates for network backbones. Thus, there has been a lot 
of work on the construction of Euclidean spanners in both the centralized setting ifTTl [39l and the distributed 
setting KT1 . 

In this paper we are interested in the emergence of good Euclidean spanners formed by uncoordinated 
agents. Many real-world networks, such as the transportation network, the Internet backbone network, the 
flight network, are good spanners — one can typically drive from any city to any other city in the U.S. 
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with the total travel distance at most a small constant times their straight line distance; similarly, one can 
find connecting flights between any two towns with the total flight distance not much more than the direct 
flight distance, possibly by flights of different airlines. The same thing happens with the Internet backbone 
graph as well. However, these large networks are not owned or built by any single authority. They are often 
assembled with pieces built by different governments (federal, states, or county governments), different airline 
companies, or different ISPs. These agents have their own agenda and interests. It does not seem to be the 
case that they have carefully planned the construction in collaboration before hand and clearly the edges are 
not built all at the same time. Nevertheless altogether they provide a convenient sparse spanner for the users 
that is close to the best spanner one can find in a centralized coordinated setting. The work in this paper is 
largely motivated by this observation and we would like to interpret and understand why a good Euclidean 
spanner is able to 'emerge' from these agents incrementally. 

From the application's perspective, we are interested in the construction of nice network overlay or in- 
frastructure topologies used in many distributed network services such as transportation network, peer-to-peer 
(P2P) file sharing and content distribution applications Poll . Such overlay or infrastructure networks are con- 
structed in a distributed manner without centralized control, to achieve robustness and scalability. The agents 
self-organize themselves in a overlay network topology by choosing other agents to connect to directly. While 
prior work in overlay design has focused on system robustness and network scalability, a significant amount 
of work in recent years have focused on reducing routing delay lfT2ll43ll34ll48ll49l . Most of these work adapt 
the current overlay topology to respect the topology of the underlying network and only show by experiments 
the improvement of the average delay. The fundamental question still yet to be answered is as follows ll43l : 
for a peer in the P2P network, what set of neighbors should it connect to, such that the shortest path rout- 
ing latency on the resultant overlay is low, compared with the minimum delay in the underlying network? 
Obviously a spanner graph would be a good solution for the overlay construction, yet there is no centralized 
authority in the P2P network that supervises the spanner construction and the peers may join or leave the 
network frequently (the system churn rate is high). The work in this paper initiates the study of the emergence 
of good spanners in the setting when there is little coordination between the agents and the users only need a 
modest amount of incomplete information of the current overlay topology. 

Our contribution. We consider in this paper the following model that abstracts the scenarios explained 
earlier. There are n points in the plane. Each point represents a separate agent and may build edges from itself 
to some other points by the strategy to be explained later. The edges in the final graph is the collection of 
edges built by all the agents. The agents may decide to build edges at different point in time. When an agent 
p plans on whether an edge from itself to another point q should be built or not, p checks to see whether there 
is already an edge from some points p' to q' such that \pp'\ and \qq'\ are both within m^fj^j • \p'q'\ from p 
and q respectively. If not, the edge pq is built, otherwise it is not. This strategy is very intuitive — if there is 
already a cross-country highway from Washington D.C. to San Francisco, it does not make economical sense 
to build a highway from New York to Los Angeles. We assume that each agent will eventually check on each 
possible edge from itself to all the other points, but the order on who checks which edge can be completely 
arbitrary. With this strategy, the agents only make decisions with limited information and no agent has full 
control over how and what graph will be constructed. It is not obvious that this strategy will end up with a 
sparse spanner on all points. It is even not clear that the graph will be connected. 

The main result in this paper is to prove that with the above strategy executed in any arbitrary order, the 
graph built at the end of the process is a sparse spanner graph with the following properties: 

• Between any two points p, q, there is a path with stretch 1 + e and 0(|pgj 1 ^ 1+2 / £ - ) ) hops. 

• The number of edges is 0(n). 

• The total edge length of the spanner is 0([MST| • log a), where a is the aspect ratio, i.e., the ratio of 
the distance between the furthest pair and the closest pair, and jMSTj is the total edge length of the 
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minimum spanning tree of the point set. Clearly |MST| is a lower bound on the total edge length of any 
constant stretch spanner. 

• The degree of each point is 0(log a) in the worst case and O(l) on average. 

An example of such a spanner is shown in Figure[T]in the Appendix. The above results for Euclidean space can 
be extended to a metric with constant doubling dimension. We also show that the spanner can be constructed 
for n agents with the help of a near neighbor oracle such that 

• Only a total number of 0(n log a) messages need to be exchanged between the agents during the entire 
construction process. 

• The spanner topology is implicitly stored on the nodes with each node's storage cost bounded by 
O(logo). It turns out with this representation the nearest neighbor of each node is included in the 
information stored on each node. 

• And yet simply with the partial information stored at each node, there is a local distributed algorithm 
that finds a (1 + e)-stretch path with maximum 0(|pg| 1 /( 1+2 / £ )) hops between any two nodes. 

To explain how this result is proved, we first obtain as a side product the following greedy algorithm for 
computing a well-separated pair decomposition of the points with optimal size. A pair of two sets of points, 
(A, B) with A, B as subsets of V, is called s-w ell-separated if the smallest distance between any two points 
in A, B respectively is at least s times greater than the diameters of A and B. An s-w ell-separated pair 
decomposition (s-WSPD for short) is a collection of s-well-separated pairs W = {{A i: Bi)} such that for any 
pair of points p,q G V there is a pair (A, B) 6 W with p G A and q G B. The size of an s-WSPD is the 
number of point set pairs in W. Well-separated pair decomposition (WSPD) was first introduced by Callahan 
and Kosaraju [El and they developed algorithms for computing an s-WSPD with linear size for points in M. d . 
Since then WSPD has found many applications in computing /c-nearest neighbors, n-body potential fields, 
geometric spanners and approximate minimum spanning trees ll5ll6ll8ir7ll3ll2l l38ll35ll25l[T8l . 

So far there are two algorithms for computing optimal size WSPD, one in the original paper [8 ] and one in 
a later paper |[23l . Both of them use a hierarchical organization of the points (e.g., the fair split tree in [El and 
the discrete center hierarchy in ll23l ) and output the well-separated pairs in a recursive way. In this paper we 
show the following simple greedy algorithm also outputs an s-WSPD with linear size. In particular, we take 
an arbitrary pair of points p, q that is not yet covered in any existing well-separated pair, and consider the pair 
of subsets (B r (p), B r {q)) with r = \pq\/{2s + 2) and B r (p) (B r (q)) as the set of points of V within distance 
r from p (q). Clearly (B r (p), B r (q)) is an s-well-separated pair and now all the pairs of points (p', q') with 
p' G B r (p) and q' G B r (q) are covered. The algorithm continues until all pairs of points are covered. We 
show that, no matter in which order the pairs are selected, the greedy algorithm will always output a linear 
number of well-separated pairs. 

The key idea in proving the linear size WSPD generated by the greedy algorithm is to show that at most 
a constant number of the generated well-separated pairs can be mapped to each well-separated pair generated 
by the deformable spanner 11231 . a data structure that has found many applications in proximity search with 
efficient update algorithm in both the kinetic [23] and dynamic settings P4l l24l . It has been shown in ll23l 
that the deformable spanner implies a WSPD with linear size. Thus the greedy algorithm also finds a linear 
number of pairs. The greedy WSPD also has a number of nice properties (not necessarily carried by the 
WSPD constructed in [8J) as to be shown in more details later. The greedy WSPD algorithm may be of 
interest by itself. 

Well-separated pair decomposition is deeply connected to geometric spanners. In fact, any WSPD will 
generate a spanner graph if one puts an edge between an arbitrary pair of points p, q from each well-separated 
pair (A, B) G W (3j |2l [38] [321. The number of edges in the spanner equals to the size of W. In the other 
direction, the deformable spanner in ll23l implies a WSPD of linear size. The connection is further witnessed 
in this paper — in our spanner emergence algorithm, each agent p constructs an edge to q only when there 
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is no nearby edge connecting points near p to points near q, this simple rule implies a WSPD generated in a 
greedy manner. Hence our spanner construction in an uncoordinated manner inherits many nice properties of 
the greedy WSPD. 

Last, this paper focuses on the case when the points are distributed in the Euclidean space. The basic idea 
extends naturally to metrics with constant doubling dimensions lT26ll40ll42l . as the main technique involves 
essentially various forms of geometric packing arguments. 

Related work. The model and the philosophy in this paper are related to the network creation game |[T9l[T3l 
[32l[TJ[37l. Fabrikant et al. fl9l was the first to introduce the network creation game, in order to understand 
the evolution of network topologies by selfish agents. The model used there (and in follow-up papers) assigns 
a cost function to each agent that captures the cost paid by the agents to build connections to others minus the 
benefit received from the resulted network topology. The agents play a game by minimizing their individual 
costs. Almost all these papers use a unit cost for each edge and they deviate in how the benefit of 'being 
connected to others' is modeled. These papers are interested in the existence of Nash equilibria and the price 
of anarchy of Nash equilibria. There are two major open questions along this direction. First, the choice of cost 
functions is heuristic — often some intuitive cost functions are selected. There is little understanding on what 
cost function best captures the reality yet small variation in the cost function may result in big changes in the 
network topologies at Nash equilibria. It is also not easy to execute the game in practice — either because the 
selfish agents may face deadlines and have to decide on building an edge or not immediately and sometimes 
the edges already built cannot be removed later (e.g., in the development of the transportation network by 
different local governments), or because the agents do not have the big picture and the current strategies of 
all other agents (e.g., in the P2P setting). The second problem is that there is not much understanding of the 
topologies at Nash equilibria. Some of the topologies at Nash equilibria in these papers are very simplistic 
topologies such as trees or complete graphs (and these topologies do not show up often in real world). For 
other more sophisticated topologies, there is not much understanding of their characteristics and therefore it 
is not clear whether these topologies are desirable. Our model is connected to the game theoretic model in 
the way that we also try to relax the requirement of a centralized authority in the graph construction, yet we 
also incorporate practical considerations that may not allow the agents from playing games to reach a Nash 
equilibrium. We believe such models and good algorithms under these models worth further exploration and 
this paper makes a first step along this line. 

In the vast amount of prior literature on geometric spanners, there are three main ideas: G -graphs, the 
greedy spanners, and the WSPD-induced spanners ll39l . We will review two spanner construction ideas that 
are most related to our approach. The first idea is the path-greedy spanner construction |[TTl[T4l[T5l[T6l . All 
pairwise edges are ordered with non-decreasing lengths and checked in that order. An edge is included in the 
spanner if the shortest path in the current graph is longer than A times the Euclidean distance, and is discarded 
otherwise. Variants of this idea generate spanners with constant degree and total weight 0(|MST|). This idea 
cannot be applied in our setting as edges constructed in practice may not be in non-decreasing order of their 
lengths and in a P2P network with high churn rate it is too much overhead to compute the shortest path length 
in the current overlay network (while checking the distance between two nodes, i.e., the path length in the 
underlying network topology, can be done easily with a TraceRoute command). The second idea is to use 
the gap property ifTTTl — the sources and sinks of any two edges in an edge set are separated by a distance 
at least proportional to the length of the shorter of the two edges and their directions are differed no more 
than a given angle. The gap-greedy algorithm [4] considers pairs of points, again, in order of non-decreasing 
distances, and includes an edge in the spanner if and only if it does not violate the gap property. The spanner 
generated this way has constant degree and total weight 0(|MST|). Compared with our algorithm, our strategy 
is a relaxation of the gap property in the way that the edges in our spanner may have one of their endpoints 
arbitrarily close (or at the same points) and we have no restriction on the direction of the edges. The proof 
techniques are also quite different. The proof for the gap greedy algorithm requires heavily plane geometry 
tools and our proof technique only uses packing argument and can be extended to the general metric setting as 
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long as a similar packing argument holds. To get these benefit our algorithm has slightly worse upper bounds 
on the spanner weight by a logarithmic factor. 

Spanner construction for metric space of constant doubling dimension has been proposed before (9l [TOj, 
[29l . These algorithms are centralized. 

Organization. In the rest of the paper we first elaborate the spanner construction in an uncoordinated man- 
ner and then show the connection of the spanner with the greedy WSPD. We then show the nice properties of 
both the greedy WSPD and our spanner. At the end, we describe how to apply the spanner in a decentralized 
setting to support low-storage spanner representation and efficient local low-stretch routing. 



2 Uncoordinated spanner construction and a greedy algorithm for WSPD 

Assuming n points in W 1 , each point represents an agent. We consider the following algorithm for constructing 
a sparse spanner with stretch factor s in an uncoordinated way. For any point p, denote by B r (p) the collection 
of points that are within distance r from point p, i.e., inside the ball with radius r centered at p. 

Uncoordinated spanner construction. Each point/agent p will check to see whether an edge from itself 
to another point q should be constructed or not. At this point there might be some edges already constructed 
by other agents. The order of which agent checks on which edge is completely arbitrary. Specifically, p 
performs the following operation: 

Check where there is already an edge p'q' such that p and q are within distance 2 (^+i) from p', q' respec- 
tively. If so, p does not build the edge to q. Otherwise, p will build an edge to q. 

This incremental construction of edges is executed by different agents in a completely uncoordinated 
manner. We assume that no two agents perform the above strategy at exactly the same time. Thus when 
any agent conducts the above process, the decision is based on the current network already constructed. 
The algorithm terminates when all agents finish checking the edges from themselves to all other points. In 
this paper we first study the properties of the constructed graph G by these uncoordinated behaviors. We 
will discuss later in Section [5] a proper complexity model for the uncoordinated construction in a distributed 
environment and also bound the computing cost of this spanner. 

Before we proceed with our proof, we first realize the following invariant is maintained by the graph G. 
The proof follows immediately from the construction of G. 

Lemma 2.1. I. For any edge pq that is not in G, there is another edge p'q' in G such that \pp'\ < 
\p'q'\/(2s + 2),\qq'\ < \p'q'\/(2s + 2). 

2. For any two edges pq, p'q' in the constructed graph G, suppose that pq is built before p'q', then one of 
the following is true: \pp'\ > \pq\/(2s + 2) or \qq'\ > \pq\/(2s + 2). 

To show that the algorithm eventually outputs a good spanner, we first show the connection of G with the 
notion well-separated pair decomposition. 

Definition 2.2 (Well-separated pair). Let s > be a constant, and a pair of sets of points A, B is s- 
separated, if d(A,B) > s ■ max(diam(A) , diam(B)) , where diam(A) is the diameter of the point set A, 
diam(A) = max \pq\, and d(A, B) = min \pq\. 

p,q&A p£A,q£B 

Definition 2.3 (Well-separated pair decomposition). Let s > be a constant, and V be a point set. An 
s-well-separated pair decomposition (WSPD) ofV is a set of pairs W = {(At, B\), . . . , (A m , B m )}, s.t. 

1. Ai,Bi C P, and the pair sets Ai and Bi are s-separated for every i. 

2. For any two points p,q G V, there is at least one pair (Ai, Bi) such thatp G Ai and q G Bi. 
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Here m is called the size of the WSPD. 



In our construction of G, it is not so hard to see that a well-separated pair decomposition is actually 
implied. 

Theorem 2.4. From the uncoordinated construction of the graph G, we can build the following s-well- 
separated pair decomposition W: for each edge pq in G, include in W the pair (B r (p), B r {q)), with r = 
\pq\/(2s + 2). The size of the WSPD is the number of edges in G. 

Proof: First each pair (B r (p), B r (q)) is an s-well-separated pair. Obviously, d(B r (p), B r (q)) > \pq\ — 2r, 

and dmm(B r (p)), diam(B r (q)) < 2r. One can then verify that d(B r (p), B r (q)) > s-max(diam(B r (p)),diam(B r (q))). 

We now show that any point p, q is included in one well-separated pair. If the edge pq is in the graph 
the claim is true obviously. Otherwise, there is an edge p'q' in G such that \pp'\ < \p'q'\/(2s + 2), \qq'\ < 
\p'q'\/(2s + 2), by LemmaO This means thatp G B r j(p') and q G B r >{q') with r' = \p'q'\/(2s + 2). This 
finishes the proof. □ 

A greedy algorithm for well-separated pair decomposition. The above theorem shows the connection of 
the uncoordinated graph G with a WSPD W. In fact, the way to compute the WSPD W via the construction 
of G is equivalent to the following algorithm that computes an s-WSPD, in a greedy fashion, with s > 1. 

1. Choose an arbitrary pair (p, q), not yet covered by existing well-separated pairs in W. 

2. Include the pair of point sets B r (p) and B r (q) in the WSPD W, with r = \pq\/(2 + 2s). 

3. Label every point pair (pi, qi) with pi G B r (p) and qi G B r (q) as being covered. 

4. Repeat the above steps until every pair of points is covered. 

With the ,s-WSPD W, the uncoordinate construction of the graph G is in fact by taking an edge from each 
and every well-separated pair in W — the simple rule in Lemma |2~T1 prevented two edges from the same well- 
separated pair in W to be constructed. It is already known that for any well-separated pair decomposition, if 
one edge is taken from each well-separated pair, then the edges will become a spanner on the original point 
set (3l 121 [38] [3H. For our specific greedy s-WSPD, we are able to get a slightly better stretch as shown in the 
theorem below. 

Theorem 2.5. Graph G constructed from the greedy s-WSPD is a spanner with stretch factor (s + l)/(s — 1) . 

Proof: Denote by ir(p, q) the shortest path length between p, q in the graph G. We show that 7r(p, q) < \pq\ 
for any p,q G P, with j3 = (s + 1) /(s — 1). We prove this claim by induction on the distance between two 
points p, q. Take p, q as the closest pair of P. Then any s-WSPD will have to use a singleton pair (p, q) 
to cover the pair (p, q) if s > 1. If otherwise, say (P, Q) is an s-well-separated pair that covers (p, q), and 
\P\ > 1. Thendiam(P) > \pq\, and d(P, Q) = \pq\. This contradicts with the fact that d(P, Q) > s-diam(P). 
Therefore the edge pq is included in G for sure and 7r(p, q) = \pq\. 

Now suppose that for all pairs of nodes x, y with Euclidean distance \xy\ < £, we have ir(x, y) < f3- \xy\. 
Now we consider the pair of nodes p, q with the smallest distance (among all remaining pairs) that is still 
greater than I. (p, q) is covered by an s-well-separated pair (P, Q) G W, where P = B r (p') and Q = B r (q') 
with r = \p'q'\/{2s + 2) and p'q 1 an edge in G. Now we argue that \pp'\ < £. If otherwise, \pq\ > d(P, Q) > 
s ■ diam(P) > s • \pp'\ > \pp'\. So we should have selected the pair (p,p') instead of (p, q). Similarly, 
\qq'\ < £. Thus by induction hypothesis vr(p,p') < (3 ■ \pp'\, ^{q,q') < (i ■ \qq'\- By triangle inequality, 
we have Tr(p, q) < n(p,p') + \p'q'\ + ir(q, q') < (5 ■ {\pp'\ + \qq'\) + \p'q'\ < 2(3 ■ r + \p'q'\. On the other 
hand, we know by triangle inequality that \pq\ > \p'q'\ —2r = • \p'q'\. Combining everything we get that 

n(p, q) < (^fj + 1) • ^ ■ \pq\ = (3 ■ \pq\, with f3 = (s + l)/(s - 1). This finishes the proof. □ 

To make the stretch factor as 1 + e, we just take s = l + 2/ein our spanner construction. We also want to 
show that the spanner is sparse and has some other nice properties useful for our applications. For that we 
will first show that the greedy WSPD algorithm will output a linear number of well-separated pairs. 
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3 The uncoordinated spanner has linear size 



To show that the WSPD by the greedy algorithm has a linear number of pairs, we actually show the connection 
of this WSPD with a specific WSPD constructed by the deformable spanner ||23l , in the way that at most a 
constant number of pairs in W is mapped to each well-separated pair constructed by the deformable spanner. 
To be consistent, in the following description, the greedy WSPD is denoted by W and the WSPD constructed 
by the deformable spanner is denoted by VV. 

3.1 Deformable spanner and WSPD 

In this section, we review the basic definition of the deformable spanner and some related properties, which 
will be used in our own algorithm analysis in the next subsection. 

Given a set of points V in the plane, a set of discrete centers with radius r is defined to be the maximal 
set S C V that satisfies the covering property and the separation property: any point p € V is within distance 
r to some point p' G S; and every two points in S are of distance at least r away from each other. In other 
words, all the points in V can be covered by balls with radius r, whose centers are exactly those points in the 
discrete center set S. And these balls do not cover other discrete centers. 

We now define a hierarchy of discrete centers in a recursive way. So is the original point set V. Si is 
the discrete center set of Sj_i with radius 2*. Without loss of generality we assume that the closest pair has 
distance 1 (as we can scale the point set and do not change the combinatorial structure of the discrete center 
hierarchy). Thus the number of levels of the discrete center hierarchy is lg a, where a is the aspect ratio of 
the point set V, defined as the ratio of the maximum pairwise distance to the minimum pairwise distance, 
that is, a = max \uv\/ min Since a point p may stay in multiple consecutive levels and correspond to 

multiple nodes in the discrete center hierarchy, we denote by pW the existence of p at level i. For each point 
_i on level i — 1, it is within distance 2 l from at least one other point on level i. Thus we assign 
to p(* _1 ) a parent in Si such that |p( l ~ 1 )q( t ) | < 2\ When there are multiple points in Si that cover p^ % ~ l \ 
we choose one as its parent arbitrarily. We denote by P(p( 4-1 )) the parent of p^~^ on level i. We denote by 
_pW (p) = p(p( l ~ l ) (p)) the ancestor of p at level i. 

The deformable spanner is based on the hierarchy, with all edges between two points u and v in Si if 
\uv\ < c • 2\ where c is a constant equal to 4 + 16/e. 

We remark that Krauthgamer and Lee ll33l independently proposed a very similar hierarchical structure 
for proximity search in metrics with doubling dimension ll26l . 

Now we will restate some important properties of the deformable spanner that will be useful in our algo- 
rithm analysis. 

Lemma 3.1 (Packing Lemma [23 1). In a point set S C R d , if every two points are at least distance r away 
from each other, then there can be at most (2R/r + l) d points in S within any ball with radius R. 

Lemma 3.2 (Deformable spanner properties Il23l ). For a set ofn points in R d with aspect ratio a, 

1. For any point p £ So, its ancestor pW (p) g Si is of distance at most 2 l+1 away from p. 

2. Any point p e Sj has at most (1 + 2c) d — 1 edges with other points of Si. 

3. The deformable spanner G is a (1 + e) -spanner with 0{n/e d ) edges. 

4. G has total weight 0(\MST\ ■ lga/e d+1 ), where \MST\ is the weight of the minimal spanning tree of 
the point set S. 

As shown in |[23l . the deformable spanner implies a well-separated pair decomposition VV by taking all 
the 'cousin pairs'. Specifically, for a node pW on level i, we denote by Pj the collection of points that are 
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decedents of pW (including pM itself). Now we take the pair (Pi,Qi), the sets of decedents of a cousin 
pair pW and q( l \ i.e., pW and <jrW are «o? neighbors in level i but their parents are neighbors in level i + 
This collection of pairs constitutes a | -well-separated pair decomposition. The size of VV is bounded by the 
number of cousin pairs and is shown to be in the order of 0{n/e d ). 

3.2 Greedy well-separated pair decomposition has linear size 

With the WSPD VV constructed by the deformable spanner, we now prove that the greedy WSPD W has 
lineal - size as well. The basic idea is to map the pairs in W to the pairs in VV and show that at most a constant 
number of pairs in VV map to the same pair in VV. 

Theorem 3.3. The greedy s-WSPD VV has size 0(ns d ). 

Proof: Suppose that we have constructed a deformable spanner DS with c = 4(s + 1) and obtained an s- 
well-separated pair decomposition (WSPD) of it, call it VV, where s = c/4 — 1. The size of VV is 0{ns d ). 
Now we will construct a map that takes each pair in VV and map it to a pair in VV. 

Each pair (P,Q) in W is created by considering the points inside the balls B r (p), B r (q) with radius 
r = \pq\/(2 + 2s) around p, q. Now we consider the ancestors of p, q in the spanner DS respectively. There is 
a unique level i such that the ancestor itj = PW (p) and t?j = PW (q) do not have a spanner edge in between 
but the ancestor Ui+x = P^ l+l \p) and Vi + \ = P^ l+l \q) have an edge in between. The pair Ui, t?j is a cousin 
pair by definition and thus their decedents correspond to an s-well-separated pair (Pj, Qi) in VV. We say that 
the pair (B r (p), B r (q)) € W maps to the descendant pair (Pj, Qj) € VV. 

By the discrete center hierarchy (Lemma 13.21) and that Ui,Vi do not have an edge in the spanner, i.e., 
\uiVi\ > c • 2*, we show that, 

|p<?| > \uiVi\ - \pui\ - \qvi\ > \uiVi\ - 2 ■ 2 i+1 > (c - 4) • 2\ 

Also, since m+i,Vi+i have an edge in the spanner, \ui + \Vi + i | < c • 2 l+1 , 

\pq\ < Ipui+il + l^+i^+xl + l^+il <2-2 i+2 +c-2 i+1 = 2(c + 4)-2 i . 

Similarly, we have 

c- 2 i < \uiVi\ < \u iUi+1 \ + |iti+iu i+ i| + \viVi+i\ < 2 ■ 2 i+l + c-2 i+l = 2(c + 2) • 2\ 

Therefore the distance between p and q is d • \uiVi\, where (c — 4) /(2c + 4) < d < (2c + 8)/c. 

Now suppose two pairs (P ri (pi), B ri (q\)), (B r2 (p2), B r2 (q2)) in W map to the same pair m and Vi by 
the above process. Without loss of generality suppose that pi,qi are selected before P2,<?2 in our greedy 
algorithm. Here are some observations: 

1. |pi<7i| = d x ■ \uiVi\, |p 2 <? 2 | = d 2 -\uiVi\, n = |pigi|/(2+2s) = d 1 -\u i v i \/(2 + 2s), r 2 = d 2 -\uiVi\/ {2 + 
2s), where (c — 4) /(2c + 4) < d x , d 2 < (2c + 8)/c, and n, r 2 are the radii of the balls for the two pairs 
respectively. 

2. The reason that (p 2 , q 2 ) can be selected in our greedy algorithm is that at least one of p 2 or q 2 is outside 
the balls P(pi), B(qi), by Lemma I27TI This says that at least one of p2 or q 2 is of distance n away 
from pi, (ft. 

Now we look at all the pairs (pe, qg) that map to the same ancestor pair (uj, vi). The pairs are ordered 
in the same order as they are constructed, i.e., p\, q\ is the first pair selected in the greedy WSPD algorithm. 
Suppose r m j n is the minimum among all radius rj. r m i n = \pq\ m in / {2+ s) > (c— 4)/(2c+4)-c-2*/(2+2s) = 
s/(2s + 3) • 2 i+1 . We group these pairs in the following way. The first group Hi contains (pi, and all the 
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pairs (pz, qg) that have pi within distance r m i n /2 from p\. We say that (pi , q±) is the representative pair in Hi 
and the other pairs in Hi are close to the pair (pi, qi). The second group H2 contains, among all remaining 
pairs, the pair that was selected in the greedy algorithm the earliest, and all the pairs that are close to it. We 
repeat this process to group all the pairs into k groups, Hi, H2, • • • , For all the pairs in each group Hj, 
we have one representative pair, denoted by (pj, qj) and the rest of the pairs in this group are close to it. 

We first bound the number of pairs belonging to each group by a constant with a packing argument. With 
our group criteria and the above observations, all pn in the group Hj are within radius r m i n from each other. 
This means that the q/s must be far away — the q£s must be at least distance r. m j n away from each other, by 
Lemma I27TI On the other hand, all the qgs are descendant of the node Vi, so \viqt\ < 2 %+l by Theorem 13.21 
That is, all the q/s are within a ball of radius 2 t+1 centered at Uj. By the packing Lemma |3~T1 the number of 
such qg's is at most (2 • 2 i+1 /r min + l) d < (2 • 2 m (2s + 3)/(s • 2 i+1 ) + l) d = (5 + 6/s) d . This is also the 
bound on the number of pairs inside each group. 

Now we bound the number of different groups, i.e., the value k. For the representative pairs of the k 
groups, (pi, qi), (p2, (72), ■ • • , {Pk,Qk), all the pj's must be at least distance r m j n /2 away from each other. 
Again these pi's are all descendant of U{ and thus are within distance 2* +1 from U{. By a similar packing 
argument, the number of such p^'s is bounded by (4 • 2 l+1 /r m j n + l) d < (9 + 12/s) d . So the total number 
of pairs mapped to the same ancestor pair in W will be at most (5 + 6/s) d ■ (9 + 12/s) rf = (0(1 + \/s)) d . 
Thus the total number of pairs in W is at most 0(ns d ). This finishes the proof. □ 

4 Size, degree, weight and diameter of the uncoordinated spanner 

With the result that the greedy WSPD has linear size in the previous section and the connection of the greedy 
WSPD with the uncoordinated spanner construction in Section^ we are able to obtain the following theorems. 
The proofs use various of packing arguments. 

Theorem 4.1. The uncoordinated spanner G with parameter s is a spanner with stretch factor (s + l)/(s — 1) 
and has 0(ns d ) number of edges. 

Proof: The number of edges in the spanner is the same as the size of the greedy WSPD W with the same 
parameter s constructed by selecting the same set of edges in the same order. □ 

Theorem 4.2. G has a maximal degree of Oilga ■ s d ) and average degree 0(s d ). 

Proof: With the same argument as in Theorem l3.3[ each pair (p, q) built in the uncoordinated spanner maps to 
a pair of ancestors (pW (p), pW (q)) in the deformable spanner that is a cousin pair. Consider all the edges of 
p in G, (p, qi), that map to the same ancestor pair (pW (p), pW (q)). By a similar argument, all the q/s must 
be at least distance r m j n away from each other (since all these pairs have p as the first element in the pair). 
Thus we have the number of such edges is bounded by (5 + 6/s) d . The number of cousin pairs associated with 
pW (p) is at most 5 d times the number of adjacent edges of p( t+1 ) (p), and is bounded by 5^ • [(8s + 9) d — 1] 
(by Theorem 13.21) . Since there are |_lg cej levels, the total number of edges associated with the node p is at 
most |lgaj ■ 5 d • [(8s + 9) d — 1] • (5 + 6/s) d . Then the maximal degree of the spanner is 0(lga • s d ). 

Since the spanner has total 0(ns d ) edges, the average degree is 0(s d ). □ 

Theorem 4.3. G has total weight 0{\ga ■ \MST\ ■ s d+1 ). 

Proof: Again we use the mapping of the uncoordinated spanner edges to the cousin pairs in the deformable 
spanner DS, as in Theorem [33] We also use the same notation here. Consider all the edges (p^, qi) that map 
to the same ancestor cousin pair (u,i,Vi). We now map them to the edge between the parents of this cousin 



10 



Figure 2. An example shows that we may achieve the tight bound 2\pq\ 1 ^ 1+lg ^ on the hop distance between point p and q. 

pair, i.e., edge Uj+x^i+i in DS. The pair (114+1, Vi+i) has at most 5 2rf number of cousin pairs. Thus at most 
(5 + 6/s) d ■ (9 + l2/s) d ■ 5 2d = (0(1 + l/s)) d edges in G are mapped to one edge in DS. 

Now we will bound the length of an edge pq in G and the edge Ui+\Vi+\ in DS it maps to. From the 
proof of Theorem 13.31 we know that (c — 4) • 2* < \pq\ < 2(c + 4) • 2\ In addition, |uj + i-Uj + i| < 2c • 2* as 
Ui + \Vi + i is an edge in DS, and |uj + ii>j + i| > — \ui + \Ui \ — > c ■ 2 1 — 2 • 2'' ,+1 = (c — 4) • 2*. 

Thus, (c-4)/(2c) < \pq\/\u i+1 v i+1 \ < 2(c + 4)/(c - 4). 

We now bound the total weight of the spanner G. We group all the edges by the spanner edge in DS that 
they map to. Thus we have the total weight of G is at most 2(c + 4)/(c — 4) • (0(1 + l/s)) d the weight of 
DS. By Theorem [3^21 the weight of DS is at most 0(\ga ■ |MST| • s d+1 ). Thus the weight of G is at most 
0(\ga • |MST| • □ 

Theorem 4.4. For any two points p and q in G, there is a path with stretch (s + l)/(s — 1) between p and q 
with atmost2\pq\ l / { - l+l? -^ hops. 

Proof: For any two point p and q, they will be covered by a pair set (P, Q) with respect to edge p'q' so that 
p 6 P and q € Q. The path p q between p and q can be found recursively by taking the path p ~^ p', 
then the edge p'q' , and then the path q' ~> g. This path found recursively will have stretch (a + l)/(s — 1) 
according to Theorem 12.5 1 

Obviously \pp'\ < r < \pq\/(2s). Denote by h(\pq\) the hop count of the path between p and q with 
stretch (s + l)/(s — 1). Thus we can get the following recurrence h(\pq\) = h(\pp'\) + 1 + h(\qq'\) < 
h(\pq\/(2s)) + 1 + h(\pq\/(2s)), that is, h(x) = 2h(x/(2s)) + 1. Solve this recurrence we get h(\pq\) = 
2 1 S2 S Ip?I+i = 2 1 slH/lg(2s)+i = 2\pq\ 1 / ( - 1+1 % s \ 

The analysis is tight as shown by Figure |2l In the example, we assume all the nodes almost lying on the 
line. The 2 balls with the same radius on the left side(or right side) is the pair that covers p(or q), and there is 
an edge between them(in order to see clearly, we draw curve line between them). The red bold balls are the 
balls that contain p{ox q on the left side). If we create these pairs in decreasing order on the radius, then we 
will have a path between p and q with exactly 2|pg| 1 /( 1+lgs ) hops. And we assume all the rest points are far 
away, then this is the only path between p and q. □ 



5 Spanner construction and routing 

The spanner construction is so far only defined on points in Euclidean space. Now, we will first extend our 
spanner results to a more general metric, metric with constant doubling dimension. The doubling dimension 
of a metric space (X, d) is the smallest value 7 such that each ball of radius R can be covered by at most 2 7 
balls of radius R/2 11261 . Metrics with constant doubling dimension appear in many different settings. For 
example, it has been discovered that the Internet delay metric has restricted growth rate and approximately 
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has a constant doubling dimension I40ll42l . As the main technique used in the spanner analysis is packing 
argument, the following Theorem is the extension of the spanner in the metric case. The proof is in the 
Appendix. 

Theorem 5.1. For n points and a metric space defined on them with constant doubling dimension 7, the 
uncoordinated spanner construction outputs a spanner G with stretch factor (s + l)/(s — 1), has total weight 
0(7 9 • lga • \MST\ ■ s°^) and has 0(j 4 ■ n ■ s°^) number of edges. Also it has a maximal degree of 
average degree 0( 7 4 • s°W). 

Proof: The proof follows almost the same as those in the previous section. The deformable spanner can be 
applied for metrics with constant doubling dimension [23]. Whenever we use a geometric packing argument, 
we replace by the property of metrics of constant doubling dimension. We just need to notice that, with the 
proposition from 11261 . the number of pairs in a group that might be mapped to the same ancestor is bounded 

by 7 rig(2K/r)l = 7 rig(2.2<+7( S /(2 S +3).2«+i))l = y +rig(2+3/s)1 < ^ ^ ±& rf . & bounded by 

7 flg(2H/(r/2))l = 7 rig(4.2 l + 1 /(^/(2s+3)-2 l + 1 ))l < ^5 > D 

Model of computing. Now we would like to discuss the computing model as well as the construction cost 
of an uncoordinated spanner. Each distributed agent checks whether an edge to another agent should be built 
or not. The order for these operations can be arbitrary. We assume that there is already an oracle that answers 
near neighbors queries: return the list of nodes within distance r from any node x. Such near neighbor oracle 
may be already available in the applications. For example, in transportation network, if we think each city as 
an agent, it usually stores the nearby cities and the corresponding transportation road by them. For a metric 
with doubling dimension, this oracle can be implemented in a number of ways ll42l [331 l30l . For example, 
such functions can be implemented in Tapestry, a P2P system |[3~Tll50l . 

Spanner construction and representation. The spanner edges are recorded in a distributed fashion so that 
no node has the entire picture of the spanner topology. After each edge pq in G is constructed, the peers p, q 
will inform their neighboring nodes (those in B r (p) and B r (q) with r = \pq\/(2s + 2)) that such an edge pq 
exists so that they will not try to connect to one another. These neighboring nodes can be obtained through 
a near neighbor search from both p and q. We assume that these messages are delivered immediately so that 
when any edge is built the previous constructed edges have been informed to nodes of relevance. We remark 
that the nodes in B r (p) and B r (q) will only store the single edge pq, as well as their distance to p (for those in 
B r {p)) or q (for those in B r (q)) and the value of \pq\, but not all the nodes in B r (p) or B r (q). When a node x 
considers whether an edge xy should be built, x could simply consult with the edges that it has stored locally. 
The amount of storage at each node x is proportional to the number of well-separated pairs that include x. 
The number of messages for this operation is bounded by |P r (p)| + |P r (g)|. The following theorem shows 
that the total number of such messages during the execution of the algorithm is almost linear in n. That is, on 
average each node sends about 0(s d ■ lg a) messages. Also the amount of storage at each node is bounded by 
0(s d lga). 

Theorem 5.2. For the uncoordinated spanner G and the corresponding greedy WSPD W = {(Pi, Qi)} with 
size m, each node x is included in at most 0(s d lg a) well-separated pairs in W. 

Proof: For each point x, each well-separated pair that contains x can be mapped to some ancestor pair in 
some level in the corresponding deformable spanner. Let's consider the set pair (P, Q) (with respect to edge 
pq) that is mapped to the ancestor pair (pM (p) , pW (g)) = (ui,Vi) at level i with x € P. Now we map x to 
the node n, and we count how many such nodes on level i that a node x may map to. The corresponding 
radius with (p, q) is r = \pq\/(2 + 2s) < (1 + l/(s + 1))2* +2 . In this case, \xpi\ < \pui\ + \xp\ < 2' +1 + r < 
(3 + 2/(s + 1))2* +1 . According to Lemma [3721 different nodes in level i must be at least 2* away from each 
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other. Withapacking argument, there can be at most [((2-(3+2/(s + l)))-2 i+1 )/2' + l] d = (13 + 8/(s + l)) d 
different such nodes p,i on level i that x may map to. So the number of well-separated pairs that cover x is at 
most 

Sl=i S Pi (cousin pairs with pi | • (pairs in W mapping to a cousin pair with pi | 

< lga • (13 + 8/(a + l)) d • 5 d [(8s + 9) d - 1] • (0(1 + l/s) d ) 
= 0(s d lga). 

□ 

Theorem 5.3. For the uncoordinated spanner G and the corresponding greedy WSPD W = {(Pi, Qi)} with 
size m, YliLi(\Pi\ + \Qi\) — 0(ns d ■ lga). Thus the total messages involved in the spanner construction 
algorithm is 0(ns d - lga). 

Proof: This follows immediately from Theorem 15.21 as 

m 

y^(|Pj| + \Qi\) < IP airs that include x\ = 0(ns d ■ lga). 

i=l x 

□ 

Distributed low-stretch routing on spanner. Although the spanner topology is implicitly stored on the 
nodes with each node only knows some piece of it, we are actually able to do a distributed and local routing 
on the spanner with only information available at the nodes such that the path discovered has maximum stretch 
(s + l)/(s — 1). In particular, for any node p who has a message to send to node q, it is guaranteed that (p, q) 
is covered by a well-separated pair (B r (p'), B r (q'j) with p £ B r (p') and q S B r (q'). By the construction 
algorithm, the edge p'q', after constructed, is informed to all nodes in B r (p') U B r (q'), including p. Thus p 
includes in the packet a partial route with {p p ; ,p'q',q' q}- The notation p ^ p' means that p will 
need to first find out the low-stretch path from p to the node p' (inductively), from where the edge p'q 1 can be 
taken, such that with another low-stretch path to be found out from q' to q, the message can be delivered to 
q. This way of routing with partial routing information stored with the packet is similar to the idea of source 
routing B71 except that we do not include the full routing path at the source node. By the same induction as 
used in the proof of spanner stretch (Theorem l2.5l ). the final path is going to have stretch at most (s+ 1)/ (s — 1) 
and at most 2|p(7| 1 /( 1+lgs ) hops. 

Support for nearest neighbor search. The constructed spanner can be used to look for the nearest peer in 
the P2P network. Since we let each point x keep all the edges (p, q) that cover x, among all these p's, one of 
them must be the nearest neighbor of x. If otherwise, suppose y is the nearest neighbor of x, and y is not one 
of p. But in the WSPD W, (x, y) will belong to one of the pair set (P, Q), which corresponds to a spanner 
edge (p', q') that covers x. Then there is a contradiction, as \xp' j < diam(P) < d(P, Q)/s < d(P, Q) < \xy\ 
implies that y is not the nearest neighbor of x. According to Theorem l5.2l x will belong to at most 0(s d lg a) 
different pair sets. So the nearest neighbor search can be done in 0(s d lg a) time, with only the information 
stored on x. 

Node insertion and deletion. The uncoordinated spanner construction supports node insertion and deletion. 
When a peer x joins the network, it will check with each other peer whether or not a nearby edge exists as 
specified in our greedy algorithm. When a peer y leaves the network, p will notify the nodes that are covered 
by p's edges, i.e., for each edge pq, p will notify q, and all the nodes within \pq\/(2s + 2) from p and q. Then 
the notified nodes will check and possibly build new edges to restore the spanner property. In this way, the 
spanner and the good properties can be maintained. 
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6 Conclusion and future work 

This paper aims to explain the emergence of good spanners from the behaviors of agents with their own 
interests. The results can be immediately applied to the construction of good network overlays by distributed 
peers with incomplete information. For our future work we would like to explore incentive-based overlay 
construction |[20l . One problem faced in the current P2P system design is to reward peers that contribute to the 
network maintenance or service quality and punish the peers that try to take free rides |[2^l27ll28ll2Tll45ll46ll . 
We would like to extend the results in this paper and come up with a spanner construction with different 
quality of service for different peers to achieve fairness — those who build more edges should have a smaller 
stretch to all other nodes and those who do not build many edges are punished accordingly by making the 
distances to others slightly longer. 
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